智能论文笔记

Center-Embedding and Constituency in the Brain and a New Characterization of Context-Free Languages

Daniel Mitropolsky , Adiba Ejaz , Mirah Shi , Mihalis Yannakakis , Christos H. Papadimitriou

分类：自然语言处理

2022-06-27

最近显示出一种仅通过神经元的尖峰实现的计算系统，即语法，即进行简单的英语句子的依赖性解析。我们解决了这项工作所留下的两个最重要的问题：选区（句子的关键部分，例如动词短语）和处理依赖句子的处理，尤其是中央句子。我们表明，语言的这两个方面也可以由神经元和突触以与已知或被广泛相信的语言器官的结构和功能兼容的方式来实现。令人惊讶的是，我们实施中心嵌入的方式指出了无上下文语言的新表征。

translated by 谷歌翻译

Planning with Biological Neurons and Synapses

Francesco d'Amore , Daniel Mitropolsky , Pierluigi Crescenzi , Emanuele Natale , Christos H. Papadimitriou

分类：人工智能 | 神经与进化计算

2021-12-15

我们重新审视块世界中的规划问题，我们为此任务实施了一个已知的启发式。重要的是，我们的实施是生物学上可言论的，因此它仅通过神经元的尖峰进行。尽管在过去五十年中，在块世界中已经在块世界中完成了很多，但我们认为这是它的第一个算法。输入是编码初始块堆栈以及目标集合的符号序列，并且输出是一系列运动命令，例如“将顶部块放在桌子上的堆栈1中”。该程序是在组装微积分中写入的，最近提出的计算框架通过弥合神经活动与认知功能之间的差距来模拟大脑中的计算。其基本对象是神经元的组件（稳定的神经元组，其同时射击表示该主题正在考虑对象，概念，单词等），其命令包括项目和合并，其执行模型基于广泛接受的原则神经科学。该框架中的一个程序基本上建立了神经元的动态系统和最终具有高概率，实现任务的神经元和突触。这项工作的目的是凭经验建立了大会微积分中的合理大计划可以正确可靠地执行;这相当现实 - 如果理想化 - 更高的认知功能，例如在块世界中规划，可以通过这些程序成功实现。

translated by 谷歌翻译

Assemblies of neurons learn to classify well-separated distributions

Max Dabagia , Christos H. Papadimitriou , Santosh S. Vempala

分类：神经与进化计算 | 人工智能 | 机器学习

2021-10-07

组件是大量的神经元，其同步射击被假设代表记忆，概念，单词和其他认知类别。据信，组件可以在高级认知现象和低级神经活动之间提供桥梁。最近，已经显示出一种称为“大会微积分（AC）”的计算系统，其曲目在集会上具有生物学上合理的操作，能够模拟由任意空间的计算模拟，但也可以模拟复杂的认知现象，例如语言，推理和规划和计划。但是，尚不清楚组装可以调解学习的机制。在这里，我们提出了这样一种机制，并严格证明，对于标记组件的分布定义的简单分类问题，可以可靠地形成代表每个类别的新组装，以响应类中的一些刺激。因此，该组件是对同一类的新刺激的响应可靠地召回的。此外，只要相应的类是相似组件的簇时，这些类组件就可以区分。为了证明这些结果，我们利用具有动态边缘权重的随机图理论来估计激活顶点的序列，在过去五年中对该领域的先前计算和定理产生了强烈的概括。这些定理通过实验证明了组件的成功形成，这些定理代表了从此类分布中绘制的合成数据以及MNIST上的概念类别的形成，并在MNIST上，这可以通过每个数字的一个组装来分类。该机制被视为一种学习算法，完全是在线上，从很少的样本中概括，只需要轻度的监督 - 在大脑模型中学习的所有关键属性。

translated by 谷歌翻译

QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation -- Analysis of Ranking Metrics and Benchmarking Results

Raghav Mehta , Angelos Filos , Ujjwal Baid , Chiharu Sako , Richard McKinley , Michael Rebsamen , Katrin Dätwyler , Raphael Meier , Piotr Radojewski , Gowtham Krishnan Murugesan

分类：计算机视觉 | 机器学习

2021-12-19

深度学习（DL）模型为各种医学成像基准挑战提供了最先进的性能，包括脑肿瘤细分（BRATS）挑战。然而，局灶性病理多隔室分割（例如，肿瘤和病变子区）的任务特别具有挑战性，并且潜在的错误阻碍DL模型转化为临床工作流程。量化不确定形式的DL模型预测的可靠性，可以实现最不确定的地区的临床审查，从而建立信任并铺平临床翻译。最近，已经引入了许多不确定性估计方法，用于DL医学图像分割任务。开发指标评估和比较不确定性措施的表现将有助于最终用户制定更明智的决策。在本研究中，我们探索并评估在Brats 2019-2020任务期间开发的公制，以对不确定量化量化（Qu-Brats），并旨在评估和排列脑肿瘤多隔室分割的不确定性估计。该公制（1）奖励不确定性估计，对正确断言产生高置信度，以及在不正确的断言处分配低置信水平的估计数，（2）惩罚导致更高百分比的无关正确断言百分比的不确定性措施。我们进一步基准测试由14个独立参与的Qu-Brats 2020的分割不确定性，所有这些都参与了主要的Brats细分任务。总体而言，我们的研究结果证实了不确定性估计提供了分割算法的重要性和互补价值，因此突出了医学图像分析中不确定性量化的需求。我们的评估代码在HTTPS://github.com/ragmeh11/qu-brats公开提供。

translated by 谷歌翻译

Fine-Grained Hard Negative Mining: Generalizing Mitosis Detection with a Fifth of the MIDOG 2022 Dataset

Maxime W. Lafarge , Viktor H. Koelzer

分类：计算机视觉

2023-01-03

Making histopathology image classifiers robust to a wide range of real-world variability is a challenging task. Here, we describe a candidate deep learning solution for the Mitosis Domain Generalization Challenge 2022 (MIDOG) to address the problem of generalization for mitosis detection in images of hematoxylin-eosin-stained histology slides under high variability (scanner, tissue type and species variability). Our approach consists in training a rotation-invariant deep learning model using aggressive data augmentation with a training set enriched with hard negative examples and automatically selected negative examples from the unlabeled part of the challenge dataset. To optimize the performance of our models, we investigated a hard negative mining regime search procedure that lead us to train our best model using a subset of image patches representing 19.6% of our training partition of the challenge dataset. Our candidate model ensemble achieved a F1-score of .697 on the final test set after automated evaluation on the challenge platform, achieving the third best overall score in the MIDOG 2022 Challenge.

translated by 谷歌翻译

MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding

Steven H. Wang , Antoine Scardigli , Leonard Tang , Wei Chen , Dimitry Levkin , Anya Chen , Spencer Ball , Thomas Woodside , Oliver Zhang , Dan Hendrycks

分类：自然语言处理

2023-01-02

Reading comprehension of legal text can be a particularly challenging task due to the length and complexity of legal clauses and a shortage of expert-annotated datasets. To address this challenge, we introduce the Merger Agreement Understanding Dataset (MAUD), an expert-annotated reading comprehension dataset based on the American Bar Association's 2021 Public Target Deal Points Study, with over 39,000 examples and over 47,000 total annotations. Our fine-tuned Transformer baselines show promising results, with models performing well above random on most questions. However, on a large subset of questions, there is still room for significant improvement. As the only expert-annotated merger agreement dataset, MAUD is valuable as a benchmark for both the legal profession and the NLP community.

translated by 谷歌翻译

UltraProp: Principled and Explainable Propagation on Large Graphs

Meng-Chieh Lee , Shubhranshu Shekhar , Jaemin Yoo , Christos Faloutsos

分类：机器学习

2022-12-31

Given a large graph with few node labels, how can we (a) identify the mixed network-effect of the graph and (b) predict the unknown labels accurately and efficiently? This work proposes Network Effect Analysis (NEA) and UltraProp, which are based on two insights: (a) the network-effect (NE) insight: a graph can exhibit not only one of homophily and heterophily, but also both or none in a label-wise manner, and (b) the neighbor-differentiation (ND) insight: neighbors have different degrees of influence on the target node based on the strength of connections. NEA provides a statistical test to check whether a graph exhibits network-effect or not, and surprisingly discovers the absence of NE in many real-world graphs known to have heterophily. UltraProp solves the node classification problem with notable advantages: (a) Accurate, thanks to the network-effect (NE) and neighbor-differentiation (ND) insights; (b) Explainable, precisely estimating the compatibility matrix; (c) Scalable, being linear with the input size and handling graphs with millions of nodes; and (d) Principled, with closed-form formula and theoretical guarantee. Applied on eight real-world graph datasets, UltraProp outperforms top competitors in terms of accuracy and run time, requiring only stock CPU servers. On a large real-world graph with 1.6M nodes and 22.3M edges, UltraProp achieves more than 9 times speedup (12 minutes vs. 2 hours) compared to most competitors.

translated by 谷歌翻译

Detection of Groups with Biased Representation in Ranking

Yuval Moskovitch , Jinyang Li , H. V. Jagadish

分类：机器学习

2022-12-30

Real-life tools for decision-making in many critical domains are based on ranking results. With the increasing awareness of algorithmic fairness, recent works have presented measures for fairness in ranking. Many of those definitions consider the representation of different ``protected groups'', in the top-$k$ ranked items, for any reasonable $k$. Given the protected groups, confirming algorithmic fairness is a simple task. However, the groups' definitions may be unknown in advance. In this paper, we study the problem of detecting groups with biased representation in the top-$k$ ranked items, eliminating the need to pre-define protected groups. The number of such groups possible can be exponential, making the problem hard. We propose efficient search algorithms for two different fairness measures: global representation bounds, and proportional representation. Then we propose a method to explain the bias in the representations of groups utilizing the notion of Shapley values. We conclude with an experimental study, showing the scalability of our approach and demonstrating the usefulness of the proposed algorithms.

translated by 谷歌翻译

DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading

Hasan Md Tusfiqur , Duy M. H. Nguyen , Mai T. N. Truong , Triet A. Nguyen , Binh T. Nguyen , Michael Barz , Hans-Juergen Profitlich , Ngoc T. T. Than , Ngan Le , Pengtao Xie

分类：计算机视觉

2022-12-30

Diabetic Retinopathy (DR) is a leading cause of vision loss in the world, and early DR detection is necessary to prevent vision loss and support an appropriate treatment. In this work, we leverage interactive machine learning and introduce a joint learning framework, termed DRG-Net, to effectively learn both disease grading and multi-lesion segmentation. Our DRG-Net consists of two modules: (i) DRG-AI-System to classify DR Grading, localize lesion areas, and provide visual explanations; (ii) DRG-Expert-Interaction to receive feedback from user-expert and improve the DRG-AI-System. To deal with sparse data, we utilize transfer learning mechanisms to extract invariant feature representations by using Wasserstein distance and adversarial learning-based entropy minimization. Besides, we propose a novel attention strategy at both low- and high-level features to automatically select the most significant lesion information and provide explainable properties. In terms of human interaction, we further develop DRG-Net as a tool that enables expert users to correct the system's predictions, which may then be used to update the system as a whole. Moreover, thanks to the attention mechanism and loss functions constraint between lesion features and classification features, our approach can be robust given a certain level of noise in the feedback of users. We have benchmarked DRG-Net on the two largest DR datasets, i.e., IDRID and FGADR, and compared it to various state-of-the-art deep learning networks. In addition to outperforming other SOTA approaches, DRG-Net is effectively updated using user feedback, even in a weakly-supervised manner.

translated by 谷歌翻译

Examining Political Rhetoric with Epistemic Stance Detection

Ankita Gupta , Su Lin Blodgett , Justin H Gross , Brendan O'Connor

分类：自然语言处理

2022-12-29

Participants in political discourse employ rhetorical strategies -- such as hedging, attributions, or denials -- to display varying degrees of belief commitments to claims proposed by themselves or others. Traditionally, political scientists have studied these epistemic phenomena through labor-intensive manual content analysis. We propose to help automate such work through epistemic stance prediction, drawn from research in computational semantics, to distinguish at the clausal level what is asserted, denied, or only ambivalently suggested by the author or other mentioned entities (belief holders). We first develop a simple RoBERTa-based model for multi-source stance predictions that outperforms more complex state-of-the-art modeling. Then we demonstrate its novel application to political science by conducting a large-scale analysis of the Mass Market Manifestos corpus of U.S. political opinion books, where we characterize trends in cited belief holders -- respected allies and opposed bogeymen -- across U.S. political ideologies.

translated by 谷歌翻译